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Abstract. - The superfamily phenomenon of time series with different dynamics can be charac- 
terized by the motif rank patterns observed in the nearest-neighbor networks of the time series 
in phase space. However, the determinants of superfamily classification are unclear. We attack 
this problem by studying the influence of linear temporal correlations and multifractality using 
fractional Brownian motions (FBMs) and multifractal random walks (MRWs). Numerical inves- 
tigations unveil that the classification of superfamily phenomenon is uniquely determined by the 
detrended fluctuation analysis (DFA) scaling exponent a of the time series. Only four motif pat- 
terns are observed in the simulated data, which are delimited by three DFA scaling exponents 
q ~ 0.25, a ~ 0.35 and a ~ 0.45. The validity of the result is confirmed by stock market indexes 
and turbulence velocity signals. 



Introduction. — In the last few decades, the un- 
derstanding of the dynamics and the topological struc- 
tures of complex systems has experienced extremely rapid 
progress. The behavior of a complex system is usually 
recorded and exhibited in the form of time series. New 
light has been shed on these studies from the perspective 
of complex networks [1-3], which considers the interac- 
tions of different constituents of a complex system though 
a portfolio of time series. More recently, a wealth of meth- 
ods have been proposed to construct complex networks 
from single time series [4-15], which enables us to study 
the dynamics of a complex system through the topological 
structure of the constructed network. These methods have 
been applied to investigate diverse natural and social sys- 
tems [16-24], among which the visibility graph algorithm 
has attracted most attention [10,11,19-24]. 

Many complex biological, technological and social net- 
works share similar universal statistical properties, such as 
the small world characteristics [25, 26] and the scale- free 
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distribution of node degrees [27]. The global properties 
can be used to classify networks into different universality 
classes from a macroscopic point view [26]. On the other 
hand, complex networks with the same global statistics 
may exhibit different local structure properties. At the 
microscopic level, the building blocks of complex networks 
are subgraphs or motifs [28], and the network motif pat- 
terns of occurrence can be utilized to define supcrfamilics 
of networks [29] . The superfamily phenomenon is also ob- 
served in time series, which is able to distinguish different 
types of dynamics in periodic, chaotic and noisy processes 
based on the occurrence frequency patterns of network 
motifs [9]. 

In this Letter, we attempt to seek possible determinants 
that cause different motif rank patterns emerging in com- 
plex nearest-neighbor networks constructed from a specific 
time scries whose increments are stationary. Numerical 
investigations conducted on fractional Brownian motions 
(FBMs) and multifractal random walks (MRWs) uncover 
that the superfamily phenomenon is uniquely determined 
by the DFA scaling exponent and the multifractality in 
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MRWs has negligible impact. We also study the superfam- 
ily phenomenon using several stock indexes of developed 
and emerging markets and the velocity signals of three- 
dimensional fully developed turbulent flows, and find that 
the above numerical conclusion holds as well. 

Nearest neighbor networks and motifs. The 

procedure to construct a nearest-neighbor network from 
a time scries is described as follows [9]. The time series 
is embedded in a proper phase space, in which the time 
delay r is calculated according to the mutual information 
method [30] and the embedding dimension d should be 
larger than 2D + 1 to make sure that the attractors are 
unfolded, where D is the correlation dimension of the sys- 
tem. Each point in the phase space is considered as a node 
of the network. For each node, four links are generated to 
its nearest neighbors that have not been connected to it. 
In this way, a nearest-neighbor network is constructed. 

For a time series with length £, the nearest-neighbor net- 
work has n = £ — dr nodes and the average degree of node 
is 8. In this Letter, we choose £ = 10 4 and d = 10, and 
the results presented are averaged over many realizations. 
Certainly, different orders of chosen nodes in construct- 
ing connections result in different networks. However, our 
results are independent of the node order. Since all the 
constructed networks have the same size, we can directly 
compare the motif distributions without any reference to 
random networks [9,28,29]. 

Following Ref. [9], we consider motifs with four nodes. 
Fig. [1] shows all possible motifs that are connected undi- 
rected graphs containing 4 four nodes. In general, different 
networks have different motif distributions, and it is not 
necessary that all motifs appear. For instance, a chain- 
like network or a loop has only motif A, a star-like graph 
has only motif B, and a complete graph has only motif 
F. It is argued that distinct systems with the same motif 
occurrence rank may share some similar key elements and 
belong to a same superfamily [9,29]. 




Fig. 1: All six network motifs of size 4 in undirected nearest- 
neighbor networks. These motifs are labeled A, B, C, D, E, 
and F, respectively. 



Numerical investigation of superfamily phe- 
nomenon in FBMs and MRWs. We synthesize 
FBMs using wavelet-based algorithm to simulate FBMs 
[31] and MRWs [32]. For each input Hurst index H, which 
runs from 0.05 to 0.95 with a step of 0.05, 100 realizations 
of FBMs and MRWs are generated, and the corresponding 
motif distribution for each time series is obtained. Fig. [2] 
illustrates the dependence of the averaged motif ranks in 
descending order of the occurrence frequency with respect 



to the Hurst index H for the two types of random walks. 
The insets of Fig. [5] compare the DFA scaling exponent a 
of the sample series and the input Hurst index H for the 
FBMs and MRWs based on 100 realizations. Specifically, 
for each FBM (MRW) series with input Hurst index H, 
we calculate the DFA scaling exponent a based on the de- 
trended fluctuation analysis [33,34]. It is found that a is 
very close to H for the FBMs, while a is almost identical 
to H when H ^ 0.45 but systematically deviates from H 
when H < 0.45 for the MRWs [19]. 
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Fig. 2: Network motif ranks of fractional Brownian motions (a) 
and multifractal random walks (b) with different Hurst indexes 
H . The two insets compare the DFA scaling exponent a of the 
sample series and the input Hurst index H for the FBMs and 
MRWs based on 100 realizations. 

We observe that clear patterns emerge in Fig. [5] For 
the FBMs, there are four motif rank patterns: ABCDEF 
when H < 0.20, ACBDFE when 0.25 < H ^ 0.30, 
ACDBFE when 0.35 H < 0.40, and ACDFBE when 
0.45 ^ H. For the MRWs, there are three motif rank 
patterns: ACBDFE when H ^ 0.30, ACDBFE when 
0.35 sC H ^ 0.40, and ACDFBE when 0.45 sC H. Strik- 
ingly, when we compare the two cases using the DFA scal- 
ing exponent a, the two plots tell the same story: The 
superfamily classification is determined uniquely by the 
DFA scaling exponent. To be more clear, for H ^ 0.30 
in the case of MRWs in Fig. [2{b), a is less than 0.35 and 
greater than 0.25, which corresponds to the motif rank 
pattern ACBDFE in Fig. Eta). It is not surprising that 
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the pattern ABCDEF is missing in the case of MRWs, 
since it is hard to obtain MRW series with the DFA scal- 
ing exponent a smaller than 0.20. We can thus focus on 
the FBM case in the following discussion. 

In Fig. [21 we identify two key motifs, B and F, whose 
locations determine the associated motif rank patterns. If 
we ignore B and F, the four patterns reduce to the same 
combination ACDE. This phenomenon is also observed in 
all the cases presented in Ref. [9] . The relative occurrence 
frequency (or rank) of motif B decreases with the increase 
of the DFA scaling exponent, and the rank of motif F 
increases with the DFA scaling exponent. We also find 
that, with the increase of the the DFA scaling exponent, 
the ranks of C and D increase and the rank of E decreases. 
These trends can be quantitatively explained as follows, 
especially for B and F. 

Motif B occurs when the neighbors of the point are all 
far from each other, while motif F occurs when the four 
points are all close enough to each other. Therefore, mo- 
tif B represents the most irregular structure while motif 
F indicates the most transitive condition. For large DFA 
scaling exponent, the time series is relatively more smooth. 
The consequence is that data points with close time mo- 
ments in the time series are mapped to close points in the 
phase space that are more probable to connect with each 
other. More F motifs is expected to appear. For small 
DFA scaling exponent, the time series is relatively more 
rough. It follows that close points in the time series are 
mapped to a sparse region in phase space. In the sparse 
region, nontransitivc structures such as motif B will be 
common [9]. 

Superfamily phenomenon in real systems. We 

have shown above that correlated time series have a unique 
superfamily with a specific motif rank pattern ACDFBE 
while anticorrelated time series may exhibit four different 
motif rank patterns based on the associated DFA scal- 
ing exponents. In this section, we investigate the super- 
family phenomenon in two real systems using stock mar- 
ket indexes and turbulence velocity. The similarity be- 
tween stock markets and turbulence has been extensively 
studied [35,36], which has stimulated the development of 
Econophysics [37]. This comparative study aims to check 
if these two systems belong to the same superfamily. 

Example 1: Turbulence velocity signals. The veloc- 
ity data set in three-dimensional fully developed turbu- 
lence have been obtained from a hot-wire probe in the 
SI wind tunnel of ONERA by the Grenoble group from 
LEGI [38]. The mean velocity of the main flow is (v) = 
20ms~ 1 , and the root mean square velocity fluctuation is 
Vxms — 1.684ms -1 . Thus the turbulence intensity is cal- 
culated to be / = w r ms/(^} = 0.0842 and the velocity is 
large enough compared with turbulent fluctuations to use 
the Taylor's frozen flow hypothesis. The integral scale 
« 2m is obtained from the autocorrelation of the velocity 
series (according to the Taylor's frozen flow hypothesis). 
The Kolmogorov microscale rj is 0.195 mm, the Taylor 



microscale A = 16.6 mm, and the Taylor-scale Reynolds 
number is ReA = 1863 [39]. 

The measurement resolution of the experiment is 3.55 x 
10 _5 s. The total length of the velocity signal is about L = 
1.7 x 10 7 . The signal is sampled at different time scales St 
(in units of the measurement resolution). In our analysis, 
St ranges from 1 to 200 with an increment of 1. For each St, 
the length of the sampled signal is about L$t = L/St, and 
100 randomly subseries with length I = 10 4 are extracted 
for analysis. The estimated DFA scaling exponents a are 
shown in Fig.[3]Ja) as a function of the sampling time scale 
St, and the motif rank patterns for each St are illustrated 
in Fig. Hb). 
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Fig. 3: (a) Dependence of the DFA scaling exponent a of the 
resampled turbulence signals with respect to the time scale St. 
(b) Dependence of the motif rank pattern on the resampling 
interval St. The values with St larger than 120 are not shown. 

In Fig. [3Jb) , we observe five motif rank orders: 
ACDFBE for St = 1, ACDBFE for 2 s$ St sC 4, ACBDFE 
for 5 St s$ 64, ABCDFE for 65 «S St < 82, and ABCDEF 
for 83 ^ St. Taking into consideration the fact that the 
motif rank pattern ABCDFE is very close to ABCDEF as 
we will show below, it is found that, with the decrease 
of St (thus the increase of a), the motif rank pattern 
changes in an order of ABCDEF, ACBDFE, ACDBFE, 
and ACDFBE. Furthermore, the switching values of a 
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Fig. 4: Network motif ranks of turbulence signals with different sampling time scales: (a) St = 1, (b) 2 ^ St ^ 4, (c) 5 ^ St ^ 64, 
(d) 65 < St 82, and (e) 83 sC St. 



from one pattern to the other are also in excellent agree- 
ment with Fig. [2j Fig. [3] also implies that the turbulence 
velocity signal possesses multiscale nature. However, the 
switching values of 5t cannot be related to the well-known 
turbulence scales (such as the dissipation subrange and 
the inertial subrange). 

Plots (a-e) of Fig. 0] illustrate typical motif ranks in 
descending order for the networks induced from velocity 
scries at different time scales. Fig. HJc) shows that the 
occurrence frequency of motif B (and motif A as well) de- 
creases with a and the frequency of motif F (and D as well) 
increases with a, as explained for the FBM case. Fig. |3Je) 
confirms this observation since the occurrence frequency of 
motif F decreases with 8t and thus increases with a. There 
are also time series (say, St = 55 in Fig. |4jc) and St = 75 
in Fig. IDJd)) that have different motif rank patterns with 
their "neighbors" . However, this situation is rare, and av- 
eraging over 100 realizations with the same St gives con- 
sistent and stable results. In addition, we find that the 
relative frequencies of motif E and F are comparable in 
Fig. [Ud), which makes the motif rank pattern ABCDFE 
marginal when compared with the pattern ABCDEF. 

The above observations are further illustrated in 
Fig- Elf)- We find that, with the increase of St, the motif 
frequencies of A, B, C and E increase, while the motif fre- 
quencies of D and F decrease. There are two decreasing 
quatcrnionic combinations, ACDE and ACDF. In addi- 
tion, there are four intersections caused by B and C, B 
and E, B and F, and E and F, which correspond to the 
four switching values of St separating the five motif rank 
patterns shown in Fig. HJa-e) . 



Example 2: Stock market indexes. Now we study the 
superfamily phenomenon of three stock market indexes: 
The DJIA from 26-May-1896 to 27-Apr-2007 of size 30147, 
the S&P 500 index from 29-Nov-1940 to 07-Jan-2008 of 
size 16943, and the German DAX index from Ol-Oct-1959 
to 15-Nov-2002 of size 10812. For these three data sets, it 
is hard to perform a similar analysis as for the turbulence 
signal due to the short length of the data. Therefore, we 
do not perform multiscale analysis on the stock index data. 
We convert the time series into three nearest-neighbor net- 
works in the phase space and count the occurrence of the 
six motifs. The motif ranks for the three stock indexes are 
shown in Fig. [5] 




Fig. 5: Network motif ranks of the nearest-neighbor networks 
induced from three stock market indexes. 
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According to Fig. [5l the three stock indexes have the 
same motif ranks ACDFBE. This is consistent with the 
regime of a ^ 0.45 in the fractional Brownian motion case 
illustrated in Fig. [^a). It is well-known that stock index 
returns are basically uncorrelated with the DFA scaling 
exponent a « 0.5 [37]. It is evident that the supcrfamily 
phenomenon of stock indexes confirms the main conclusion 
of this Letter. 

Conclusion. — Time series can be mapped into 
nearest-neighbor networks, which allows us to classify time 
series into different superfamilies based on the motif occur- 
rence patterns. We have investigated the motif patterns 
of fractional Brownian motions and multifractal random 
walks and found that the DFA scaling exponent of the 
time series plays a unique role in the supcrfamily clas- 
sification. The multifractal nature of the MRWs has no 
influence on the motif ranks. Four different motif rank 
patterns ABCDEF, ACBDFE, ACDBFE, and ACDFBE 
are identified, which correspond to different ranges of the 
DFA scaling exponent. The results of similar analyses of 
velocity time series in fully developed turbulence and daily 
stock market indexes are essentially consistent with this 
finding, except that an additional pattern ABCDEF ap- 
pears between ABCDEF and ACBDFE with a very narrow 
range of the exponent a for the turbulence case. 
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